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Abstract. When fc is a constant at least 3, a sequence S of positive integers is called 
fc-GP-free if it contains no nontrivial /e-term geometric progressions. Beiglbock, Bergelson, 
Hindman and Strauss first studied the existence of a fc-GP-free sequence with bounded gaps. 
In a previous paper the author gave a partial answer to this question by constructing a 6- 
GP-free sequence S with gaps of size 0(exp(61ogn/loglogn)). We generalize this problem 
to allow the gap function fc to grow to infinity, and ask: for which pairs of functions (h, fc) 
do there exist fc-GP-free sequences with gaps of size 0(h)l We show that whenever (fc(n) — 
3) log h(n) log log h(n) > 4 log 2-log n and h, fc satisfy mild growth conditions, such a sequence 
exists. 


1. Introduction 

Let S be an increasing sequence of positive integers. We say that S is fc-GP-free if it 
contains no fc-term geometric progressions with common ratio not equal to 1, where fc > 3 
for the problem to be nontrivial. Let fc be a nondecreasing function N — > M + . We say that 
a sequence S has gaps of size 0(h) if there exists a constant C > 0 such that for every pair 
m, A e N with m < N, the sequence S intersects the interval [m, m + Ch(N)). 

The maximal asymptotic density of a fc-GP-free sequence is well-studied [3, 10, 11, 15]. 
Beiglbock et al. [2] originally posed the related question: 

Problem 1 . Does there exist fc > 3 and a fc-GP-free sequence S such that S has gaps of 
size 0(1)? 


The standard example of a 3-GP-free sequence is the sequence Q of positive squarefree 
numbers 1, 2, 3, 5, 6 , 7,10,..., which has asymptotic density 4 . Despite its large density, the 
size of its largest gaps is not known. The best unconditional result available is that of Filaseta 
and Trifonov [5] that Q has gaps of size 0(N 1//5 log A), and Trifonov also established a gen¬ 
eralization that the sequence of fc-th-power-free numbers has gaps of size 0(N 1 ^ 2k+l ' ) log A) 
[16]. Assuming the abc conjecture, Granville showed that the gaps of Q are of size 0(N e ) 
for all e > 0 [7]. 

All of these bounds can be improved immensely if we assume the conjecture of Cramer 
that the gaps between consecutive primes are 0(log 2 A) [4], For a discussion of Cramer’s 
model and implications, see the article of Pintz [12]. The problem of bounding largest gaps 
between consecutive primes, both from above and below, is notoriously difficult, and the 
best known lower bound is 


Pn+1 Pn ^ 


C log p n log log p n log log log log p n 
log log log p n 


for some C > 0 and infinitely many n, due to Ford, Green, Konyagin, Maynard, and Tao 
[6], an improvement by log log log p n over the longstanding bound of Rankin [14], The best 
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unconditional upper bound is p n +i ~ Pn = 0(N°' 525 ), due to Baker, Harman, and Pintz [1], 
with 0(N x G log N) possible assuming the Riemann hypothesis. 

Instead of pursuing these notoriously difficult problems, in a previous paper the au¬ 
thor showed that by replacing Q by a randomly constructed analogue, we can improve on 
Granville’s bound unconditionally. 

Theorem 2. [8] There exists a 6-GP-free sequence T and a constant C > 0 such that the 
gaps of T are of size 0(exp(G log IV/ log log IV)). In fact C can be taken to be any positive 
real greater than flog 2. 

In this paper we generalize the Problem 1 as follows. Henceforth k is no longer a constant 
but a nondecreasing function k : N —> R> 3 . We say that S is k-G P-free if for every N G N, the 
finite subsequence S' D {1, 2,..., iV} does not contain any nontrivial geometric progressions 
of length at least k(N). 

Problem 3. For which pairs of functions (h, k) do there exist h-GP-free sequences S such 
that S has gaps of size 0(h)? 

We call h the gap function and k the length function, and a pair (h, k ) feasible if such an 
S exists. Thus far we have only dealt with constant length function; in particular Theorem 
2 shows that the pair (exp (C log N/ log log N), 6) is feasible. At the other end of the spec¬ 
trum, it is trivial that (1, log N/ log 2) is a feasible pair, simply because the longest possible 
geometric progression in 1,... ,7V has length at most log N/ log 2. In the last section of this 
paper we show in fact that (1,£ log IV) is feasible for any e > 0. 

To interpolate between these two situations, we prove the following theorem, extending 
the method used in [8] to prove Theorem 2. 

For two functions /, g : N —> M + we write f = 0(g) if there exists a constant C > 0 such 
that f(n) < Cg(n ) for all n G N and / = o(g ) if for every C > 0 the inequaliy f(n) < Cg(n ) 
holds for all n sufficiently large. We also write / = Ll(g) if g = 0(f). 

Theorem 4. Let (h, k) be nondecreasing functions N —* M + such thath(n) = H((logx) 1 ^ 1 ~ log2 ' ) ) 
and for all sufficiently large n, k(n) > 5. If they satisfy 

(k(n) — 3) log h(n) log log h(n) > 4 log 2 • log n, 
for all sufficiently large n, then there exists a k-GP-free sequence T with gaps of size 0(h). 

As a corollary, if k is constant we recover Theorem 2 with a weaker constant. 

2. Preliminaries 

In this section we generalize the GP-free process of [8] to probabilistically construct a 
/c-GP-free sequence. First we simplify Theorem 4 by reducing the set of possible length 
functions k. It suffices to show the following. 

Theorem 5. If k is a nondecreasing function N —> {6, 8,...} taking on even positive integer 
values at least 6, and h : N —> M + is a function satisfying h(n) = fl((loga:) 1 ^ 1 ' log2 ' ) ), 
h(n) = o(y/n) and 

(k(n) — 2) log h(n) log log h(n) > 4 log 2 • log n, 
for all n sufficiently large, then there exists a k-GP-free sequence T with gaps of size 0(h). 
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Proof, (that Theorem 5 implies Theorem 4). Suppose Theorem 5 is true, and let k be as 
in Theorem 4. We can certainly round up k to the nearest integer to begin with. It is also 
possible to ignore the finite set of n for which k < 5, since we only care about n sufficiently 
large. If we round k down to the nearest even integer, if it originally satisfied the inequality 
of Theorem 4, then it has decreased by at most 1 uniformly, so the inequality above holds. 
Finally, if we prove the theorem for all h{n) = o(y/n), then it follows for all larger h as well, 
so we may as well assume h(n) = o(y/n). □ 


Let Gk be the family of all geometric progressions of positive integers such that if t is 
the largest term, then the length is at least k(t). Enumerate them as Gk,i in order lexico¬ 
graphically as sequences of positive integers. We assume that each Gk,i has common ratio 
Tk,i > 1 - 

Furthermore, there may be longer Gk,i containing shorter ones; let G* k denote the result 
of removing from Gk all Gk,i which contain some Gkj with j ^ i. Thus to find a k- GP- 
free sequence it suffices to construct a sequence T k missing at least one element from each 
progression in G* k . Let G* ki denote the ?'-th progression in G* k . 


Definition 6. For a nondecreasing function k : N —» {6, 8 ...}, define the fc-GP-free process 
as follows. Define an integer-sequence valued random variable Uk = {u\,U 2 , • • •) where tq G 
G ki such that if 


G*k,i ~ ( a *fr 


h -1 


, a l b i 


k—2, 


,UiC k i X ), 


then Ui is chosen from <7,7/ ‘ X cJ 2 and aib k J 2 c k J 2 1 with equal probability |. Each u, is picked 
independently of the others. Then T k is the random variable whose value is the sequence of 
all positive integers never appearing in Uk, sorted in increasing order. 

It is clear that Tk is fc-GP-free by definition, as it misses at least one term out of each 
G* ki . We now bound the probability that a given n G N lies in T k generated as above. For 
i , j > 1, let d(n; i,j ) count the number of ways to factor n = ab l G for some a, b, c G N. 


Lemma 7. For a positive integer n, the sequence T k constructed in Definition 6 contains n 
with probability 

^ n ] > 2~ d ( n M m )/‘ 2 M m )/ 2 - 1 ) 

where m is any positive integer such that any G* k t containing n in its middle two terms has 
largest term at least m. 


Proof. The inequality is equivalent to the statement that n is one of the middle two terms 
in at most d(n; — 1) progressions of G* k . We form an injective correspondence 

from progression G* k t containing n in the middle two terms to factorizations of n as n = 

a jJi{m)/2 c k{m)/2-\ jf a p ro g ress j on 


G* k . = (afif- 1 


, afif 2 Ci, 


, a i°i ) 


with bi < Ci and k' > fc(a,c \ x ) contains n as one of the middle two terms, then cer¬ 
tainly k{m ) < k!. Supposing n = afiij 2 ~ x c k ^ 2 , we map G* ki to the factorization n = 
a fjk(m)/2 c k(m)/2-l a — -k(m))/2 c (k-k(m))/2, £ _ c . an p c _ fo. Similarly if 77 = 

afi k ^cf^ 2 1 we take a = a,b\ k k ^^ 2 c ^ k fc ( m ))/ 2 ^ b = bi and c = q . It is easy to see from 
the assumptions that bi < Ci and that no progression in G* k strictly contains another that 
the correspondence above is injective, as desired. □ 
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From here we can control the total probability that T k misses an entire interval of the form 
[x, x + Ch{x)). 

Lemma 8. For a gap function h{x) = ofx 1-1 /^^ -1 ^ and a constant C > 0, the sequence 
T k constructed in Definition 6 satisfies T k ft [x, x + Ch(x)) = 0 with probability 


k(x) k(x) 


- 1 


P[T fc fl [x, x + Ch(x)) — 0] < exp ^ ^ exp f — log2 • d(n\ 

n£[x,x-\-Ch(x)) 

for all x sufficiently large. 

Proof. We first prove that the events P [T k 3 n] for n G [x, x + Ch{x)) are mutually indepen¬ 
dent whenever x is sufficiently large, ft suffices to show that no progression in G* k has both 
middle terms in the interval. Considering the difference between the two middle terms in a 
G* ki , and assuming both lie inside [x,x + Ch(x)), we have 

| a i b i ‘ q — afif cfi \ > afifi cfi 

> x/bi 

> 

where k > k(m) depends on the largest term m = ajcf^ 1 > x. It follows that assuming 
h(x) = o^x 1-1 ^ fc ^ -1 ^, for any C > 0 the middle two terms in any G* ki with largest term 

at most x are further apart than Ch{x) for any x sufficiently large. 

Thus the events corresponding to each n in the interval are mutually independent, and we 
can bound the probability involved by a product 

P[Tfc n [x, x + Ch(x)) = 0] < Y[ (l - 

n€:[x,x-\-Ch(x)) 

by Lemma 7. Since the inequality 1 — t < e~ t holds for all real t we arrive at the bound 

k(m) k(m) 


P[T fc fl [x, x + Ch(x )) = 0] < exp ( — 


ex p 

n£[x ,x-\-Ch(x)) 


— log 2 • d(n; 


-D)) 


Here each m = m(n ) can certainly be chosen as any number at most n. Thus we replace 
them all by x, arriving at the desired bound. □ 

Note that since we assumed h{x) = o(ffix) the growth condition in Lemma 8 is automati¬ 
cally satisfied. 


3. Proof of the Main Theorem 


All that remains is to give lower bounds for the sum 


S{x, h,k,C) 


E 


exp 


n€:[x,x-\-Ch) 


log 2 ■ d 


k k 
2 ’ 2 



1 


where k = k(x) and h = h(x) are functions satisfying the conditions of Theorem 5. To this 
end we break down [x, x + Ch) into two sets, one of which has few ( k/2 — l)-power divisors, 
and restrict the sum to that set. 
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Lemma 9. 

large x, 


There is a positive constant B independent of x such that for all sufficiently 
SO, h, k, C) > BCh(x) exp ( - log 2 exp () ) • 


Proof. Fix an x > 0 and write k = k(x), h = h(x). Denote by A the subset of [x,x + Ch) 
consisting of all n divisible by some p fc / 2-1 , where p < h. We can bound the size of A by 


\A\ 


( Ch \ 

~ ( p k/ 2-1 + 1 ) 

prime p<h 

< (((k/2-l)-l)Ch + o(h), 


where f is the Riemann zeta function and we used the elementary Chebyshev bound n(h) = 
o(h) on the prime-counting function n. Since k > 6 and ((t) — 1 < 1 uniformly on t > 2, 
there exists a constant B such that for x, and thus h, sufficiently large, |A| < (1 — B)Ch. 

If n ^ A, we can factor n = pf 1 ■ ■ -pf: r n' where n' is (/c/2 — l)-th power free, each on > 
k /2 — 1, and each p t > h is prime. As a result, 


- 


log n 
log h 1 


so by a smoothing argument we can bound d(n ; | — 1) subject to these assumptions, 


A k k \ f log n log n \ 

d \ H: 2’ 2 ~ V ^ 6XP l log2 ' (fc/2 - 1) log ft + '° g2 ' (*/2) log h k 
where we simply bounded the number of pairs b, c satisfying 6 fc//2_1 |n and c k ^ \n. Summing 
up over all terms in [x, x + Ch) outside A, we get 

1 1 \ (2 log 2) ■ log x' 


logn 


logn \ 


S(x, h, k, C ) > RChexp ( — log 2 exp ( ( — + 


k k — 2/ 


log h 


and finally replacing 1/k < l/(k — 2) we have the desired inequality. 
Finally, we prove Theorem 5 using Lemma 9. 


□ 


Proof, (of Theorem 5). By Lemma 8 it suffices to pick h, k such that the sum of probabilities 
P [T k fl [x, x + Ch(x)) — 0] < exp(— S(x, h, k , C)) < 1 

X>1 X>1 

for C sufficiently large, forcing the probability of finding a T with gaps 0(h) to be nonzero. 
This will hold as long as the sum converges for some fixed C\ making C large enough will 
make the sum arbitrarily small. Now, suppose that (k — 2) log h log log h > 4 log 2 • logn as 
in Theorem 5. Then, applying the inequality of Lemma 9, we have 

S(x,h,k,C) > BChexp(— log2 log h) 

> BCh 1 ~ los2 , 

and finally since h, = ^((logx) 1 ^ 1 ^ 1052 )), we get 

^exp (~S(x,h,k,C)) < x ~ BCD , 

X>1 X>1 

for some constant D > 0, so picking C for which BC > 1 gives a convergent sum. □ 
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4. Closing Remarks 

The goal of this paper was to interpolate smoothly between the two feasible pairs (h, k ) = 
(exp(CTog N/ log log iV), 6) and (h,k) = (1, logiV/ log 2), and we recover both pairs, up to 
constants, in the relation 

(k(n) — 3) log h(n) log log h(n) > 4 log 2 • log n. 

Unfortunely, when k is sufficiently close to logn, then the method of Theorem 4 fails 
because h = o^logx) 1 ^ 1-10 ® 2 )). Nevertheless, we expect all pairs (h,k) which satisfy 
this inequality to be feasible. In the case that h = 1 we can make an improvement on 
(1, logiV/log 2). 

Proposition 10. For any £ > 0, if k{n) = slog n then there exists a k-GP-free sequence T 
with gaps of size 0(1). 

Proof. We say a positive integer m is divisible by a k- th power if | m for some prime 

p, and that m is fc-free otherwise. Consider the sequence T of all k- free integers; we claim 
that its gaps are uniformly bounded. In fact, note that if | m then 

ppm) < 

e log m ■ log p < 
log p < 

and so p lies in the Rnite set of all primes less than e 1 ^ . In particular, for x sufficiently large, 
the interval [x, x + ef' e + 1) will contain at least one /c-free number. Indeed, it is easy to 
check that each p < e 1//£ contributes at most one multiple of to that interval. □ 

Further improvement in the case of h small or constant along these lines is blocked by the 
Chinese Remainder Theorem. In particular, for k = o(logn) and any constant h we can find 
infinitely many intervals [x, x + h) in which each positive integer in [x, x + h) is divisible by 
arbitrarily many k(x)~ th powers of primes. 

The probabilistic method in Definition 6 is by no means optimal, but is defined in such 
a way to guarantee the independence of events in an interval [n,n + Ch ). We expect that 
a sophisticated study of redundancies in our method can substantially improve at least the 
constant in Theorem 4. 


m 

log m 

1 

1 

e 
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